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Abstract 

The overlap fermion propagator is calculated on 2 + 1 flavor domain wall fermion gauge con- 
figurations on 16 3 x 32, 24 3 x 64 and 32 3 x 64 lattices. With HYP smearing and low eigenmode 
deflation, it is shown that the inversion of the overlap operator can be expedited by ~ 20 times 
for the 16 3 x 32 lattice and ~ 80 times for the 32 3 x 64 lattice. The overhead cost for calculating 
eigenmodes ranges from 4.5 to 7.9 propagators for the above lattices. Through the study of hyper- 
fine splitting, we found that the 0(m 2 a 2 ) error is small and these dynamical fermion lattices can 
adequately accommodate quark mass up to the charm quark. A preliminary calculation of the low 
energy constant A m i x which characterizes the discretization error of the pion made up of a pair of 
sea and valence quarks in this mixed action approach is carried out via the scalar correlator with 
periodic and anti-periodic boundary conditions. It is found to be small which shifts a 300 MeV 
pion mass by ~ 10 to 19 MeV on these sets of lattices. We have studied the signal-to-noise issue of 
the noise source for the meson and baryon. We introduce a new algorithm with Zj, grid source and 
low eigenmode substitution to study the the many-to-all meson and baryon correlators. It is found 
to be efficient in reducing errors for the correlators of both mesons and baryons. With 64-point 
Z% grid source and low-mode substitution, it can reduce the statistical errors of the light quark 
(m-K ~ 200 — 300 MeV) meson and nucleon correlators by a factor of ~ 3 — 4 as compared to the 
point source. The Z% grid source itself can reduce the errors of the charmonium correlators by a 
factor of ~ 3. 



I. INTRODUCTION 

A large scale endeavor has been undertaken by the RBC and UKQCD collaborations in 
the last few years to simulate 2+1 flavor full QCD with dynamical domain wall fermions 
(DWF) and Iwasaki gauge action on several lattices with pion mass as low as ~ 300 MeV 
and volume large enough for mesons (m^L > 4) jl|-|3|. Three sets of lattices 16 3 x 32 x Ls 
and 24 3 x 64 x L s at a' 1 = 1.73(3) GeV and 32 3 x 64 x L s at a" 1 = 2.32(3) GeV with the 
fifth dimension Ls = 16 are available, each with 3 to 4 sea quark masses with the lowest pion 
mass at ~ 300 MeV. With these lattices, one can proceed to perform chiral extrapolation 
and continuum extrapolation assuming a 2 dependence of the physical quantities. Since the 
domain wall fermion with Ls = 16 is a good approximation for the chiral fermion satisfying 
Ginsparg- Wilson relation, it is shown that they have good chiral properties and that most of 
the chiral symmetry breaking effects are absorbed in the residual mass which is reasonably 
small for these set of lattices. As such, these dynamical fermion configurations are very 
valuable and can be used to calculate physical quantities reliably, at least for the mesons. 
It is suggested from the study of the nucleon axial coupling g^ and electromagnetic form 
factors that the present lattices are still small and lattices with spatial dimension of 6 fm 
might be needed in order to control the finite volume errors. 

While combined chiral extrapolation and continuum extrapolation are being carried out 
with valence domain wall fermions, we shall explore the viability of employing valence overlap 
fermions on these DWF configurations. Both the domain wall fermion and the overlap 
fermion are chiral fermions. As such, they do not have 0(a) errors and non-perturbative 
renormalization via chiral Ward identities or the ROM/RI scheme can be implemented 
relatively easily J4H6[. Furthermore, the overlap fermion has additional desirable features 
which one can take advantage of in order to improve chiral symmetry as well as the quality of 
the numerical results. First of all, the numerical implementation of the overlap fermion allows 
a precise approximation of the matrix sign function so that the errors on the sign function 
and thus the residual mass can be as small as 10 -10 in practice [13J. The approximation 
to the exact chiral symmetry can also be gauged from the Ginsparg- Wilson and the Gell- 
Mann-Oakes-Renner relations. Its multi-mass algorithm permits calculation of multiple 
quark propagators covering the range from very light quarks to the charm on these sets 
of DWF lattices. This makes it possible to include the charm quark for calculations of 



charmonium and charmed-light mesons using the same fermion formulation for the charm 
and light quarks [7|. It is also possible to incorporate partially quenched data in the chiral 
extrapolation. Since the overlap operator is a normal matrix, it is easier to calculate its 
eigenmodes and implement low-mode deflation in the matrix inversion. As we shall see, this 
can speed up the inversion of small quark mass by more than an order of magnitude with 
no critical slowing down. Furthermore, these low frequency modes can be used together 
with the noise approximation of the high-frequency modes to construct all-to-all or many- 
to-all correlators. We shall show that using Z3 grid source on a time slice is quite efficient 
in reducing variance for both the meson and baryon correlators. We should point out 
that although both the overlap and domain wall fermions are chiral fermions, using overlap 
valence on DWF gauge configurations with HYP smearing at finite lattice spacing constitutes 
a mixed action approach. Mixed action approaches have been studied by many groups such 
as DWF valence on staggered fermion sea 8], overlap valence on DWF sea [9], overlap valence 



on clover sea [l0(] , and overlap valence on twisted fermion sea Uj . It is shown that the valence 
chiral fermion has the advantage that it introduces only one extra low-energy constant A m i X 
in the mass of the pseudoscalar meson with mixed valence and sea quarks which has the 
same effect as partial quenching. The mixed action partially quenched chiral perturbation 
theory (MAPQ^PT) which has been worked out for various physical quantities with various 
combination of fermion actions will be the simplest for the combination of overlap valence 
and DWF sea. 



This manuscript is organized as follows: The formalism for solving linear equation of 
the overlap operator with low eigenmode deflation and Z 3 noise grid for the many-to-all 
correlators will be given in Sec. 2. The numerical details on the tuning of the negative mass 
parameter p in the Wilson kernel of the overlap, the speedup due to HYP smearing and 
low-mode deflation, and the role of the zero mode will be presented in Sec. 3. In Sec. 4, we 
shall present the calculation of A m i X and results for the meson and nucleon correlators with 
point source, Z 3 grid source, and Z 3 grid source with low-mode substitution. The efficacy 
of the many-to-all approach will be discussed. We will finish with a summary in Sec. 5. 



II. FORMALISM 



The massless overlap operator [l2| is defined as 



D ov (p) = l+ 1 r e(H w (p)), (1) 

where e(Hw) = H\y / ' -J H^ is the matrix sign function and H\y is taken to be the hermitian 
Wilson-Dirac operator, i.e. Hw(p) = 75-CVG )- Here D w (p) is the usual Wilson fermion 
operator, except with a negative mass parameter — p = 1/2k — 4 in which k c < k < 0.25. 
As will explained later in Sec. IIH[ we will use k = 0.2 in our calculation which corresponds 
to p = 1.5. 

The massive overlap Dirac operator is defined so that at the tree-level there is no mass 
or wavefunction renormalization [13! ]. 

D{m) = pD O0 (p)+m{l-2^) 

= P+j + (p-j)K<H w (p)). (2) 

Throughout the paper, we shall use the lattice units for dimensionful quantities, except the 
lattice spacing a will be explicit in figures. 



Deflation 



Q 



It has been advocated [14| that using deflation with low eigenmodes can speed up inversion 
of fermion matrices. It has been applied to the hermitian syste m |15l Il6j ] to speed up the 
inner loop inversion of the overlap operator and to non-hermitian 17( and hermitian system 
with multiple right-hand sides [l8j. Low-mode deflation has also been applied to domain 



decomposition [l9j . In addition to speeding up inversions, substituting exact low eigenmodes 



in the noise estimation such as in quark loops [20|, |2l| and all-to-all correlators [22H24] 
has demonstrated that better results for the meson two- and three-point functions can be 
obtained with reduced errors. 

The massive overlap Dirac operator in Eq. ([2]) has the same eigenvectors as the massless 
one, we shall consider the massless Dirac overlap D ov . Due to the normality of D ov , i.e. 
D\ v D ov = D ov D\ v and the Ginsparg- Wilson relation {j5,D 0V } = D ov ^D OV) the eigenvalues 
of D ov are on a unit circle with the center at unity. The real and chiral modes are at and 



2. Others on the circle are paired with conjugate eigenvalues. In other words, if \i) is an 
eigenvector of D ov 

Aw|«) = Ai[z>, (3) 

then 75 \i) is also an eigenvector with eigenvalue A*, 

D m ^\i) = A* 75 |z). (4) 

To calculate the eigenmodes of D ov , one notes that due to normality and 75 hermiticity 
D\ v = TsAwTsj 75 commutes with D ov D\ v , i.e. [D ov D} )V ,'y 5 } = 0. Therefore, one can 
use the Arnoldi algorithm to search for eigenmodes of D ov Dl v with real eigenvalues |Aj| 2 
which are doubly degenerate with opposite chirality. To obtain the eigenmodes of D ov , 
one can diagonalize the two chiral modes in D ov . This is much easier than searching in 
the complex plane for the eigenmodes of non-normal fermions. Since the non-zero modes 
are conjugate pairs (Eq. pi)), we need only to save half of them, e.g. those with positive 
imaginary eigenvalues. When the eigenmodes are calculated, one can proceed with deflation 
by solving the high frequency part of the propagator 

D(m) \X» R ) = (l-P L )\r ]L , R ), (5) 

where Pl = Y^7=i "' N)( ? 1 * s the projection operator to filter out the low eigenmodes. n 
is the number of zero modes which are either all left-handed or all right-handed in each 
configuration. n\ is the number of non-zero low-frequency modes which come in conjugate 
pairs. In solving Eq. (J5]), we use the conjugate gradient solver (CGNE) for D(m)D*(m). In 
this case, one can utilize the property D ov D] )V = D nv + D\ v to save a matrix multiplication 



in each iteration with the chiral source \t]l,r) [Hi uM an d the solution \Xf R ) has the same 



chirality as the source. In this case, Eq. (j3j) can be written as 

no+ni 

D{m) \Xl R ) = \^ R )- ^(|z)(z|+7 5 |i)(i|7 5 )|^ jK )(l--5 Aij0 ) 

8=1 

n +ni 

= \Vl,r) - Y. (1 T7 5 )K)(^l,k)(1 - 2^,0), (6) 

where the sum is over the zero modes and the low modes on the upper half of the eigenvalue 
circle. Although we do not calculate it this way, the high frequency part of the propagator 
can be written as 

wh \ n -i/ w \ V^ r \V\ 1 \Vl,r) =F75 WWl.k) in l x x / 7 x 
\ x l,r) = D (rn,p) \vl,r) ~ }^ "T^ T\ a"T + — : Z a^tK 1 ~ 9^,0)- (7) 



The total high frequency part of the propagator will be, in the end 

\X H ) = \X? ) + \X» >, (8) 

given that the source \rj) = \t]l) + \t)r). 



To accommodate the SU(3) chiral transformation with Sip = T'y 5 (l — l/2D ov )ip [25] which 
leads to chirally covariant flavor octet quark bilinear currents in the form tpTT(l — ^D ov )ip, 
it is usually convenient to use the chirally regulated field tp — (1 — ^D ov )ip in lieu of ip in the 
interpolation field and the currents. This turns out to be equivalent to leaving unchanged 
the unmodified interpolation field and currents and adopting instead the effective propagator 

D'eh = (1 " ^)D-\m), (9) 



which also serves to filter out the unphysical eigenmode at A = 2p 26] . 

Defining S = D~h and S = Sh + Si, where Sh/ Sl is the high/low frequency part of the 
effective propagator, Sh originating from the source 77 can be obtained, after a few steps of 
derivation, as 

( x \s H \v) = Y, s »^y)<y) = (*\&~)\x H ) = (i+^—)(x\x h )--^—(x\(i-p l )\ v ). 

z — ' 2 2p — m 2p — m 

y 

(10) 
It is worthwhile pointing out from Eq. ( ITUj) that once \X H ) is solved, there is no need to 
explicitly multiply D ov on \X ) which involves an inversion of the kernel H^ in the Zolotarev 
approximation of the matrix sign function. Similarly, the low frequency part of S can be 
obtained from spectral decomposition 

D 
~2 



( x \s L \ v ) = J2 s ^yWy) = (*l(i - -f)\x L ) 



V 

no+ni / 1 A,\ i\ -\ /-i \ ft \* 



- L. i A + m( i-f) + PK + m[ i-±) l(1 - 2*» o) ' (11) 

Since the eigenmodes are available for the low frequency modes, one can obtain the all-to-all 
propagator for this part of the spectrum 

^ pAi + m(l-f) p A| + m(l-^) 2 

In the above expressions, we have suppressed the Dirac and color indices. 



B. Low-mode substitution 



It is shown 22l424| that when noise is used to estimate the meson two- and three-point 
correlation functions in the connected insertion, substituting the noise estimated low-mode 
part of the correlator with the exact one improves statistics. Consider, for example, the 
meson correlator from the local interpolation fields Oj = ^>IY0 where the two-point correlator 

is 

C(t,p;v) = J2 e ~ iP ^( TT ^ S ^^y^Mm2l 5 S\x,t;y,0)r ] \y) l5 ]), (13) 



x,y 



where the source r] is on the t = time slice with support on {y}. Since the quark propagator 
5* is composed of the low- frequency and high-frequency parts S = Sh + Sl, the two-point 
correlation function can be decomposed into the following 



C — Chh + Chl + Clh + Cll, 



(14) 



where 



C LL = J2 e- irts ^{T l S L {x,t-y^)r t {m2l,S[{x ) t- y>, 0)rf(y>) l5 ). 



%,y,y 



The noise r] has the property 



(r](x)rf(y)} = 4, 



u- 



(15) 



(16) 



where (...) is the noise average. The standard error due to the noise estimation of the 
correlation function averaged over the gauge configurations is given by 271, 




N n N g 



(17) 



where (y 2 g jo\ is the variance of the gauge/noise ensemble, and N g /N n is the number of 
gauge/noise configurations. How good the approximation is depends on the noise estimation. 
Through the numerical study of the quark loop for the energy-momentum tensor with the 



Wilson fermion on quenched gauge configurations, it is learned [28[ that a n is much larger 
than o g with a n /a g ~ 27(46) for the case with (without) 4-term unbiased subtraction. As 
such, it is desirable to replace the noise estimate of Cll with the exact correlator to reduce 
variance due to the noise estimation. The low-mode substituted meson correlator is then 



C(t,p) sub — C — Cll + Cll, 



(18) 



with 

C LL = J2 e-^ (£ ~ ?7) Tr(r 1 5 L (f , t; y, 0)T 2l5 Sl(x , t; y, 0) 75 ), (19) 

x,yeG 

where the sum of y runs over the set G with the same support on {y} as the noise r\. 

In the case of baryon, one can use the Z 3 (e l27Tk ^ 3 , k = 0, 1, 2) noise on a time slice for the 
quark source due to the property 

{v(x)v(y)v(z))n = $x,y5 y , z , (20) 

so that it is an approximation for the superposition of multiple baryon source with three 
quarks in each of the baryon originating from the same spatial location on the support of 
the Z 3 noise. 

Similar to the meson case, one can substitute Clll, which is the part with all three 
quarks estimated by SlI], with Clll where all three quark propagators are given in terms 
of Sl- In addition, one can replace the Cell part, where one of the quark propagators is 
ShV and the other two are SlI], by Cull in which the product of the two Sltj is replaced 
by Y.y S l(x, t; y, 0)S L (x', t; y, 0)^(y). 

C(t,p) sub = C- Clll + Clll ~ P{C H ll} + P{C H ll} (21) 

where P{} refers to the set of correlators with permutation of ShV an d SlT] (or 5^) for the 
three different quarks in the baryon. It is worthwhile pointing out that Cull is like Chl 
in the meson in the sense that the error due the noise is from a n associated with Eq. (TT6I) . 
To the extent that the baryon correlator is dominated by Clll and P{Chll\ in the time 
window where the ground state baryon emerges, the variance reduction with the substitution 
in Eq. ( 12T|) is expected to be similar to that of the meson case. 

III. NUMERICAL DETAILS 

The overlap propagators are calculated on three sets of lattices of the 2 + 1 flavor domain 
wall fermion gauge configurations with the four-dimensional sizes of 16 3 x 32, 24 3 x 64 (a" 1 = 
1.73(3) GeV), and 32 3 x 64 (a -1 = 2.32(3) GeV) with several sea quark masses each. These 
are generated by the RIKEN-Brookhaven- Columbia (RBC) collaboration and the UKQCD 
collaboration l|-|3) . The matrix sign function in the overlap Dirac operator is calculated with 



14th degree Zolotarev rational polynomial approximation 



29(]. For the window [0.031, 



2.5], the approximation to the sign function is better than 3.3 x 10~ 10 [13]. This is sufficiently 
accurate as the low-mode deflation is used in the inversion of H\y with HYP smearing in the 
Zolotarev approximation, which is the inner loop of the inversion of the overlap operator, 
and the largest absolute values of the low mode eigenvalues are 0.2, 0.125, and 0.22 on 
16 3 x 32, 24 3 x 64, and 32 3 x 64 lattices with 100, 400, and 200 eigenvectors, respectively. As 



shown in Figs. 1(a), 2(a), and 3(a), the largest absolute values of the projected eigenvalues 
on the HYP smeared configurations are larger than 0.031, the threshold for high accuracy 
of the approximation of the sign function. 
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(a) (b) 

FIG. 1. (color online) (a) The spectra of the lowest 100 eigenvalues for the kernel in the inner loop 
of the overlap fermion for a 16 3 x 32 configuration with mi = 0.01. (b) The same as (a) for the 
lowest 200 eigenvalues of the outer loop overlap fermion. The unsmeared spectra are colored in 
grey and the HYP smeared spectra are colored in red. 



A. Speedup of propagator calculation 



We employ HYP smearing 30] on the gauge links which has the effect of depleting the 
density of the lowest eigenvalues in H w [3l|. As a result, the lowest eigenvalue with HYP 
smearing after deflation with 100 to 200 eigenmodes is about 3 times larger than those 
without HYP smearing. This leads to ~ 3 times speedup in the number of inner loop CG 
iterations as was found in a previous study (3l| . This is tabulated in Table [B In addition, 
for the three lattices under study, the numbers of H\y/Dov eigenmodes used for deflation 
in the inner/outer loop, the numbers of inner and outer iterations for the cases without 
deflation, with deflation, and with both deflation and HYP smearing. For the comparison 



10 




k 



-0.10 -0.05 0.00 0.05 0.10 




(a) (b) 

FIG. 2. (color online) (a) The spectra of the lowest 400 eigenvalues for the kernel in the 
of the overlap fermion for a 24 3 x 64 configuration with mi = 0.005. (b) The same as 
lowest 200 eigenvalues of the outer loop overlap fermion. The unsmeared spectra are 
grey and the HYP smeared spectra are colored in red. 
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FIG. 3. (color online) (a) The spectra of the lowest 200 eigenvalues for kernel in the 
of the overlap fermion for a 32 3 x 64 configuration with mi = 0.004. (b) The same as 
lowest 400 eigenvalues of the outer loop overlap fermion. The unsmeared spectra are 
grey and the HYP smeared spectra are colored in red. 
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study, we used light sea mass at mi = 0.01, 0.005 and 0.004 for the 16 3 x 32 (latticel6), 24 3 x 
64 (lattice24) and 32 3 x 64 (lattice32) lattices which are respectively the lowest light sea mass 
in these three lattice sets. For the valence quark, we used the quark mass which corresponds 
to the pion mass at ~ 200 MeV in all three cases. We see from Table [I] that the inner 
loop iteration number is reduced by a factor of ~ 3 due to HYP smearing. One can see 



from Figs. 1(a), 2(a), and 3(a) that this is due to the fact that after projecting out the 
small eigenvalues of H^ the resultant lowest eigenvalue with smearing is about a factor of 3 



11 



larger than those without smearing. On the other hand, the number of outer iterations are 
greatly reduced due to deflation of the low D ov eignemodes. It is interesting to note that the 
number of outer iteration for deflation with the smearing case is ~ 18%— 25% higher than the 
corresponding case without smearing. This is due to the fact that after HYP smearing, the 
imaginary part of the highest eigenvalues of the deflated eigenmode X max at 0.0707 ±i 0.434 
(lattice 16), 0.00857 ± i 0.153 (lattice 24) and 0.0115 ± i 0.186 (lattice 32) are 46%, 51%, 
and 22% smaller than those of the corresponding eigenvalues without smearing. This can 



be seen in Figs. 1(b), 2(b), and 3(b). In the end, when the total number of iterations are 
compared, the speedup with deflation and smearing to the cases without them are 23, 51, 
and 79 times respectively for the three lattices for one test configuration each. The three 
selected configurations have zero modes. We have also tested configurations without zero 
modes. It turns out the total numbers of iterations for both the cases with and without 
smearing and deflation are about the same as those of configurations with zero modes. This 
is so because the absolute eigenvalues of the lowest eigenmodes for configurations without 
zero modes (which are of the order 10~ 2 for the 16 3 x 32 lattice, 10~ 3 for the 24 3 x 64 lattice, 
and 10 -4 for the 32 3 x 64 lattice) are comparable to the smallest quark masses on these 
lattices. 

We should point out that the absolute values of the above-mentioned X m ax of the low- 
frequency modes on these lattices are much larger than the small valence quark masses 
(ranging from 0.0014 to 0.01, say) so that the small valence quark does not affect the speed 
of inversion and, thus, there is no critical slowing down for the light valence masses in 
inversions with low-mode deflation. Also listed is the overhead for producing eigenmodes of 
the overlap fermion for deflation. The cost is in the range of 4.5 to 7.9 propagators with both 
deflation and HYP smearing (D + S). This cost is to be amortized when more propagators 
are needed in calculations such as three-point functions and quark loops. To compare with 
inversion of the Wilson-type fermion, we timed the inversion of the clover fermion on 10 
2 + 1 flavor dynamical clover configurations with a size of 32 3 x 64 and pion mass of 156 
MeV at lattice spacing a = 0.09 fm from the PACS-CS Collaboration 32J. Using the CG 
solver with odd/even preconditioning and no HYP smearing, we find the average inversion 
of one propagator with the same residual of 10 -8 takes ~ 11892 iterations. Taking the the 
product of the inner and outer loop iterations for the case of lattice 32 in Table [U the overlap 
inversion with smearing and deflation has about 32% more iterations than that of the clover 

12 



TABLE I. Speedup comparison of inversion with HYP smearing (S) and deflation (D) of the outer 
loop. The inner and outer iteration numbers are the average of one column in one propagator with 
12 columns of color-spin. The speedup refers to that between the case of D + S vs the one with 
neither D nor S. The overhead of producing eigenmodes is measured in terms of the propagators 
with D + S calculation. 
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fermion in this case. 



B. Tuning of p 



We carry out the valence quark propagator calculation with 30 quark masses which cover 
the range from the physical pion mass to the charm mass. There will be a concern about 
the large finite volume effect for the pion mass as low as the physical one, we shall use those 
below m^ = 200 MeV for the finite volume study not in chiral extrapolation. To include the 
charm mass entails making sure that the heavy mass will have small enough 0(m 2 a 2 ) error 
to warrant reliable calculation for the charmonium and charm-light mesons. To this end, we 
fine-tune the negative mass parameter p in Eq. ([2]) in the range 1 < p < 2 to minimize the 
0(m 2 a 2 ) error. We conducted a test on 10 16 3 x 32 (mi = 0.01) configurations for the range 
of 1.059 < p < 1.917 to check the speed of inversion and 0(m 2 a 2 ) error assessed with the 
hyperfine splitting (the difference between the vector and pseudoscalar meson masses). It 
turns out that p = 1.5 is close to the optimal choice. It has about the fastest inversion and 
its m 2 a 2 error as measured by the hyperfine splitting in the charmonium is the smallest. To 

13 



illustrate what one means to have the smallest 0(m 2 a 2 ) error, we plot the hyperfine splitting 
as a function of ma for the case of p = 1.5 and 1.62 in Figs. H] and |5] for comparison. The 



hyperfine splittings for p = 1.5 and 1.62 are plotted in Figs. 4(a) and 5(a) as a function 



of m. In view of the fact that the excitation scales for the charmonium and the upsilon 
as measured from the 2S to IS and the average 1 3 P to 1 3 S splittings are about the same, 
it is argued 33[ based on non-relativistic Schrodinger equation that the size of the heavy 
quarkonium should scale as 



Tnn OC 



1 



m 



(22) 
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(a) (b) 

FIG. 4. The hyperfine splitting is plotted as a function of ma in (a) and 1/y/ma in (b). This is for 
p= 1.5. 
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FIG. 5. The hyperfine splitting is plotted as a function of ma in (a) and 1/y/ma in (b). This is for 
p= 1.62. 
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This prediction is checked against the leptonic decay widths, the fine and hyperfine split- 
tings 33] of charmonium and upsilon. In the case of the hyperfine splitting, the perturbative 
spin-spin interaction has the form -^p- ^'^ [^(O)) 2 , where ^(O) is the vector meson wave- 
function at the origin which is proportional to r Q -l . Thus, the hyperfine splitting of the 
heavy quarkonium is expected to scale like 

AE HFS cc -=. (23) 

\/m 



to leading order in m. Based on this observation, we plot the HFS for p = 1.5 and 1.62 in 



Figs. 4(b) and 5(b) in terms of 1/y/m. We see that in both cases, the HFS trends toward 
zero as m is heavier than 0.4, except for a few points at the heavy end which show large 
deviation from the trend. We interpret this as due to the lattice 0{m 2 a 2 ) error. To find a 
threshold of usable range of m where the estimated 0(m 2 a 2 ) error is negligible (or smaller 
than the statistical error), we fit the HFS to the form 

AE HFS = ^(1 + -), (24) 



which includes the next term in large m expansion. From the fit in the range m = 
0.4/0.6 to 0.8 for p = 1.5/1.62 (which correspond to 1/y/m = 1.58/1.29 to 1.12), we find 
a = 0.0769(6)/0.0690(7) and b = -0.0002(19)/0.0004(29) for p = 1.5/1.62. We see that the 
central value of b/m is much smaller than its error and is thus consistent with zero in both 
cases. At the heavy-mass end, the central values of the HFS are outside the fits. We find 
that at m — 0.88/0.75 for p = 1.5/1.62, the central value is beginning to be more than 2 
a away from the fitted HFS curve. We take it to be the critical value beyond which there 
is discernible 0(m 2 a 2 ) error. This suggests that, given the same relative deviation, p = 1.5 
has a longer range of usable m than p = 1.62. Thus, we decide to adopt p = 1.5. In this 
case, the charm quark is at ~ 0.73 where the central value of the HFS is consistent with 
the fitted curve well within one sigma (relative error is about 1%). Through study of the 
0(m 2 a 2 ) error with different lattice spacings for the overlap fermion, it is found 34J that 
the critical mass is insensitive to the lattice spacing and depends mostly on ma. For the 
32 3 x 64 lattice at a -1 = 2.32 GeV, the charm mass is at m ~ 0.48 which is much smaller 
than the critical mass ma = 0.88. 

We conclude that we can cover the quark mass range from light all the way to the charm 
with the overlap fermion on the three sets of DWF configurations under study. The critical 
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mass of ma ~ 0.88 for a discernible 0(m 2 a 2 ) error in the HFS is higher than that of the 



quenched case where the 0(m 2 a 2 ) error becomes appreciable (5%) when ma ~ 0.5 26]. 
This is presumably due to HYP smearing that is adopted in the present calculation with 
dynamical fermion confi gura tions. HYP smearing is also known to improve the locality of 



the overlap operator [3ll 135 ]. 



C. Zero mode issue 

The role of zero modes and topology at finite volume has been discussed extensively in 
]. The calculation of the quark condensate (ipip) with the chiral fermion 



the literature 



contains a term from the zero mode contribution -^r where Q is the topological charge of 
the configuration which, according to Atiya-Singer theorem, equals the difference between 
the numbers of left-handed and right-handed zero modes. This term vanishes in the limit 
V — > oo ((|Q|) grows as y/V) for finite m. It is shown [36] that as long as one is working in the 
region where mHV 3> 1, the zero mode contribution to the quark condensate is negligible 
as the number of zero modes per unit volume goes to zero when the volume approaches 
infinity. Based on this and the generalized Gell-Mann-Oakes-Renner relation 

i /"d 4 ^A(7r a (x)V(|/)) = ~{fal>), (25) 



where 7i a = ip^r" 1 /2ip, it is suggested |37J, |38[ that, as long as mT.V ^> 1, the contribution of 
the zeros to the pseudoscalar correlator is negligible. In this case, one expects to obtain the 
pseudoscalar mass from the exponential fall off of the correlator. To test this idea, we plot 
in Fig. [6] the correlators for the pseudoscalar masses at ~ 200, 350, 700 and 2980 MeV. on 
the 16 3 x 32 lattice for one gauge configuration with and without zero mode contributions 
(there are two zero modes in this configuration) for the purpose of illustration. We see that 
when the pion mass is as low as 200 MeV, where mHV ~ 1.8 is not much larger than unity, 
the pion correlator is greatly affected. The pion mass may not change very much, but the 
spectral weight is reduced by an order of magnitude when the zero modes are taken out. 
When the pion mass is ~ 350/700 MeV where mY,V ~ 5.5/22, we see that the correlators 
are not affected much when the zero modes are taken out. This seems to conform with the 
above idea. However, when we plot the correlator without the zero modes and examine the 
heavy quark case where mT.V ^> 1 is satisfied, we see that the correlators with and without 
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the zero modes differ by several orders of magnitude at large time separation, e.g. t > 10. 
This is so because the zero mode contribution normalized in the present volume is of the 
order > 10 -8 at large time separation; whereas, the signal for the pseudoscalar meson falls 
off exponentially with respect to time. Sooner or later, the signal will fall below the zero 
mode contribution. In other words, the zero mode at any finite volume is part of the physical 
spectrum. Except for quark condensate and other rare cases, one cannot separate out the 
zero mode contribution from the rest of the spectrum for physical observables in general. 
Even though it may not make much of a difference numerically for the meson correlator at 
relatively short range of t when mJ^V 3> 1, the large time separation will sooner or later be 
affected and this problem is more acute for the heavy quark. 
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FIG. 6. (color online) The pseudoscalar meson correlators for pion masses at ~ 200, 350, 700 and 
2980 MeV which correspond to the input quark masses at 0.006, 0.0172, 0.067, and 0.73 for a 
configuration of the 16 3 x 32 lattice with and without zero mode contributions. 



IV. RESULTS 

Since the overlap fermion is calculated on configurations generated with domain wall 
fermions, this constitutes a mixed action approach to chiral fermions. Mixed action ap- 
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proaches have been studied by several groups, such as DWF valence on staggered fermion 
sea [8 1, overlap valence on DWF sea [9j, overlap valence on clover sea 10], and overlap 



valence on twisted fermion sea 



ll| . In view of the fact that it is numerically intensive to 



simulate chiral fermions (DWF or overlap), it is practical to use the cheaper fermion for- 
mulation for generating gauge configurations and the chiral fermion for the valence as an 
expedient approach toward full unquenched QCD simulation with chiral fermions. Many 
current algebra relations depend only on the chiral property of the valence sector. The 
mixed action theory with different fermions for the valence and the sea is a generalization 
of the partially quenched theory with different sea and valence quark masses. The mixed 
action partially quenched chiral perturbation theory (MAPQ^PT) has been developed for 
Ginsparg- Wilson fermion on Wilson sea 39| and staggered sea 40j, and has been worked out 
for many hadronic quantities to next-to-leading order (NLOV, such as pseudoscalar masses 



and decav constants 



aa-kl 



41], isovector scalar a correlator 42l445| . heavy-light decay con 



stants [46J , and baryon masses |45l. 147]. 

In the mixed action chiral perturbation theory with chiral valence fermion, it is shown [39 1 
that to NLO, there is no 0(a 2 ) correction to the valence-valence meson mass due to the 
chiral symmetry of the valence fermion. Furthermore, both the chiral Lagrangian and the 
chiral extrapolation formulas for hadron properties to the one-loop level (except ^-dependent 
quantities) are independent of the sea fermion formulation 48| . The LO mixed-action chi- 
ral Lagrangian invokes only one more term with 0(a 2 ) discretization dependence which is 
characterized by a low energy constant A mix . The LO pseudoscalar meson masses are given 
as 



m,. 



m. 



B ov (m v H-rrv), 

B ov m v + B dw (m s + m res ) + a 2 A r 



m Qe/ = B d ,Jm s + 7?v + 2m r 



(26) 



where m vv > /m ss i is the mass of the pseudoscalar meson made up of valence/sea quark and 
antiquark. m vs is the mass of the mixed valence and sea pseudoscalar meson. Up to nu- 
merical accuracy, there is no residual mass for the valence overlap fermion. The DWF sea 
has a residual mass m res which vanishes as L$ — > oo. The A m j X enters in the mixed meson 
mass m vs and is an 0(a 2 ) error which vanishes at the continuum limit. We should note 
that, unlike the partially quenched case, even when the quark masses in the valence and 
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sea match, the unitarity is still violated due to the mixed action. The degree of unitarity 
violation at finite lattice spacing depends on the size of A m i X . 

A. Calculation of A m i x 

A m ,; r has been calculated for pseudoscalar mesons for DWF valence and staggered fermion 



sea 



44 



49J which gives A mix ~ (708MeV) 4 [49J and ~ (664MeV) 4 ± (437MeV) 4 |44j. It 



is also calculated for overlap valence and clover sea which yields A m i X = (872MeV) 4 ± 

a n 

(693MeV) [50]. This means that for a valence pion of 300 MeV, the A mix produces, at 
a = 0.12 fm, a shift of ~ 102 — 251 MeV for these cases which is quite large. 

Here we shall estimate A m i X in our case with overlap valence on DWF sea. To do so, 
we shall examine the meson state which wraps around the time boundary. It is known 
that a two meson interpolation field can produce meson states with two mesons propagating 



along opposite time directions 51M53J. On the other hand, the ao isovector scalar meson 



interpolation field ud, together with the quark loop from the sea, can produce a ir and r)(r)') 



propagating in different time direction [54|. This is illustrated in Fig. [7] where Fig. 7(a 



displays the situation with both the valence and the sea quarks wrapping around the time 



boundary forming two pions propagating in different time directions; whereas, Fig. 7(b) 



shows the annihilation diagram where the valence quark- ant iquark pair wraps around the 



time boundary while the sea quark loops do not. Together, Fig. 7(a) and Fig. 7(b) form an 
opposite-time propagating tc and r](r]') pair. The a correlator has, thus, the following form 

C ao = J2 Wi(e- Ea + e-^ (T - t} ) + W wv (e- m ^- m ^ T - t] + e -™r,*-™AT-t)^ 

= J2 2W t e~ E * T/2 cosh(E i (T/2 - £)) 

i 

+ 2W nv e- {m " +m ^ T/2 cosh((m„ - m n )(T/2 - £)), (27) 

where E% is the energy of one- or two-meson state which propagates in the same time direc- 
tion. Notice that the second term in Eq. (T2"T|) . which represents the r\ and n wrapping around 
the time boundary, has an exponential falloff which is proportional to the mass difference of 







rj and tt, i.e m v — m n . Since this is smaller than all the other states in the ao correlator I 1 ] it 
1 We should note that there are states where tt and rj move back to back with non-zero momenta which 
have smaller energy differences, but they will be suppressed by the corresponding pre-factor e -(. E -"+ E i) T / 2 
as compared to e -( m ^+ m >?) T / 2 i n Eq. (|27|l for the the zero-momentum tt — r] state for large T. 
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appears as the lowest state in the longest time separation in the correlator, 
state causes problems for fitting the scalar correlator to obtain the ao meson 



his low-lyinj 
54fl. 





(a) (b) 

FIG. 7. Cartoon showing the quark lines which form wrap-around it and rj mesons in the ao 
correlator, (a) Both the valence and the sea wrap around the time direction and form two pions 
which are propagating in different time directions, (b) The annihilation diagram where only the 
valence wraps around the time boundary. Together with (a), it produces it and r/(rj') propagating 
in different time directions. 

We shall take advantage of the existence of this state due to the finite time extent to 
extract A mix . As we see in Fig. [71 the structure of Fig. 7(b) is complicated. Yet, Fig. 7(a) 



is simple in that it involves the mass difference of two pions, not r\ and it as in Eq. (1271) . For 
the equal valence quark mass case, the mass difference of the two pions vanishes and one 
does not obtain any information about A m i X . But if the two valence quark masses are not 
the same, the mass difference of the two pions becomes 



m VlS -m. 



('_> s 



y / B ov m Vl + B dw (m s + m res ) + a 2 A mix -^J 'B ov m V2 + B dw (m s + m res ) + a 2 A mix , 

(28) 
where m VlS /m V2S is the pion made of quarks with i>i/i>2 overlap fermion and the s domain-wall 
fermion. In this case, one can extract A mix . To do so, one needs to remove the annihilation 



diagram in Fig. 7(b) and all the states which propagate in the same time direction. This 



can be achieved by calculating the valence propagators with both anti-periodic and periodic 
boundary conditions in time (the DWF sea has anti-periodic B.C.) and take the difference 



of the two correlators. For the annihilation diagram in Fig. 7(b) , the valence quark traverses 
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the time boundary an even number of times so that it is independent of the time boundary 



condition. On the contrary, the valence quark in Fig. 7(a) traverses the time boundary 



an odd number of times. So, taking the difference between correlators with periodic and 

as 



anti-periodic time B.C. for the valence cancels out the annihilation diagram in Fig. 7(b) 



well as the contribution from states propagating in the same time direction and one is left 



with the contribution in Fig. 7(a 



As a first attempt to extract A m j X , we consider the difference of scalar correlators from 
the 24 3 x 64 DWF lattice (light sea mass at mi = 0.005) with periodic and anti-periodic 
B.C. in time. In this case, the difference correlator at large time will be given by 



AC, 



o0 



cz -C 



U() 



AP 



AW e 

^ * ' 7T17T2 



< m ^ +m ^ T ^cosh{(m vls - m V2S )(T/2 - £)}. (29) 



As a first check, we plot in Fig. |8] such a difference correlator for the equal valence case 
(i.e. m Vl = m V2 ). We expect from Eq. ( 129]) that the correlator should be independent of t. 
As we see in Fig. [8] where such correlators are plotted for m Vl = m V2 = 0.0203 and 0.0489 
which correspond to pion masses at ~ 372 and 577 MeV, the correlators are indeed quite 
flat. This is consistent with our expectation that the large t behavior depends on the mass 
difference m VlS — m V2S which is zero in this case. To extract A^, we want to find a range 
of quark mass where the tree-level linear mass relation between m\ v and m v holds so that 
we can use Eq. ( 126]) . We plot m% v and m 2 vv jm v from the 24 3 x 64 lattice with mi = 0.005 as 
a function of m v in Fig. [9] 



m,, - 372 Mev 
rrv ~ 577 Mev 



I#*lfi5ffiffi 



Sfi/ : 



10 20 30 40 50 60 



FIG. 8. The difference of the scalar (ao) correlators with anti-periodic and periodic time boundary 
conditions for the two equal valence masses which correspond to m n = 372 and 577 MeV. 



We see that the ratio ml v a 2 /m v a in Fig. 9(b) is fairly flat for the range m v a ~ 0.0203 — 
0.0489. We shall take m Vl and m V2 from this range and fit the correlators to find m VlS — m V2S 
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which can be expressed in terms of the corresponding pseudoscalar masses and A r 



[m: 



+ ml a ) + a 2 A mix - 



1 



m 2 + m 2 ss ) + a 2 A r 



(30) 



^From the separately calculated m vlVl and m V2V2 with the valence overlap fermion and m s 
calculated with DWF [2J, we can extract A m i X . 
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FIG. 9. (a) m 2 is plotted as a function of ma for the 24 3 x 64 lattice with m = [0.00275, 0.15]. (b) 
m^/ma(GeV) 2 is plotted vs ma for the same range of quark masses. 

Since the range of m v a ~ 0.0203 — 0.0489 is narrow and m VlVl and m V2V2 are close, the 
errors on the extracted A m i X from Eq. (1301) are large. We take several combinations of m VlVl 
and m V2V2 in the range [0.0203, 0.0489] and obtain an average A mix 



a 2 A r 



-0.0112(44) GeV 2 



for the 24 3 x 64 lattice with 50 configurations at mi = 0.005. With a 



(31) 



1.73 GeV, we 



obtain 



-(427MeV) 4 ± (338MeV) z 



(32) 



This is quite small. To compare with those from other mixed actions, we notice that the 



central value is ~ 7 times smaller than the case of DWF valence on staggered sea [44J, |49] 
and ~ 18 times smaller than that of overlap on Wilson sea [10]. To put the magnitude in 
perspective, consider a 300 MeV pion on the 24 3 x 64/32 3 x 64 lattice with a ~ 0.12/0.085 
fm, the shift in mass due to A mix is ~ 19/10 MeV which is substantially smaller than the 
~ 102 — 251 MeV for the other mixed actions as alluded to earlier. As mentioned above, the 
calculation of A mix using the boundary condition method gives large errors. At this stage, 
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we are more interested in finding out how large A ma is roughly to see if it is practically 
small enough to carry out chiral extrapolation with MAPQxPT. For a more precise value, 
we shall use the mixed valence DWF and overlap propagators to directly evaluate A mix and 



check scaling as is done in Refs. [10J, 



44 



49|]. 



Coming back to the correlators in Fig. El we notice that the magnitudes of these two 
correlator differ by two orders of magnitude, a feature which is unusual for meson correlators 
with pion masses which are not that different. In this case, we note that there is a pre-factor 
e -(m via +m V2a )T/2 j n p^ (J29|) which can be sensitive to slightly different pion mass when T 
is large. Since we expect the ratio of the spectral weight W win2 in Eq. (1211 for the two 
correlators with pion masses at ~ 372 and 577 MeV to be within ~ 20% from unity when 
the mass dependence of the matrix element and the normalization factor are taken into 
account, the primary difference of the correlators in Fig. [8] for the equal masses case should 
come from the exponential pre-factor. Taking A m i X into account, the ratio of the pre-factor is 
88(13) with T = 64. This turns out to be quite close to the jackknife ratio of the calculated 
correlators in the time range [11,54] which is 83(9). This lends further support for the 
existence of the wrap-around states. 



B. Z3 grid source and low-mode substitution 

Noise has been used to estimate quark loops as well as propagators in the all-to-all 
correlators. In particular, Z 2 noise has been introduced to estimate the quark loops [55j and 



it is shown 



55 



561 ] that its variance is minimal since, unlike the Gaussian noise, it receives 



no contribution from the diagonal matrix elements of the quark propagator. Complex Z 2 (or 
Z4) has been adopted in many quark loop calculations [57H60J and the stochastic estimation 
of determinants (61J. However, the volume Z 2 noise is not a good estimator for connected 
insertion calculation where the hadron correlator C(t, 0) is needed for large time separation. 



In this case, the signal falls off exponentially (C(t, 0) 



~ e 



-mt^ 



, yet the variance decreases 



only as the inverse power of the noise number 55[. To alleviate this difficulty, dilution of the 
noise is suggested so that the noise is applied to one time slice at a time and supplemented 



with low-mode substitution 
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24|. 



In the following, we shall consider the Z 3 noise which can be used for baryons as well as 
mesons. 
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1. Signal-to-noise ratio 

We should first remark that the noise wall source, by itself, does not reduce errors as 
compared to the point source. To see this, we shall consider the variance of the meson and 



baryon correlators. Besides the large time behavior first considered by Lepage [62|, there 
are pre-factors associated with the noise source. The meson correlator in Eq. (fT3j) with a 
noise wall source has the following behavior at large t, 

C M (t,p = 0)~V 3 e- m ^, (33) 

where V3 is the three-volume of the noise with its support on a time slice. This comes from 
the noise estimate with J^- jtiv^fflviif)) ^ ^3- Thus, the signal from the noise estimator is 
larger than that of a point source by a factor of V3 according to our normalization convention 
of the noise. On the other hand, the variance of the correlator at large t is 

Na 2 M (t) « (G M (t) 2 ) - (G M (t)) 2 , (34) 

where N = N g xN n and Gm($) is the meson propagator in each gauge and noise configuration 
as defined in terms of the meson correlator in Eq. f lT3|) . i.e. 

C{t) = (G M (t)). (35) 

In the case of the flavor non-singlet meson, the lowest energy state in the variance correlator 
of Eq. ( 134")) is about the mass of two pions. The noise from the first term contributes a 
volume squared factor from the four quark propagator, i.e. 

J2 (rffflv(y)rf&)v$)) « v 3 2 . (36) 

a;,y,x',y' 

Therefore, at large t 

a M {t) « -7J=e-™-* (37) 

The signal to noise ratio is 

Cm ^P = ^ « y/N e -(<"»-«**. (38) 

0-M(t) 

The volume factor cancels out and there is no gain in statistics with the noise wall source 
as compared to a point source. For the pion correlator, the signal-to-noise ratio is nearly 
constant at large t as noted before |62| . 
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The baryon case is different. Consider the nucleon correlator which has the generic form 

C N (t,p = 0) ~ (S(t,0)S(t,0)S(t,0)), (39) 

where S(t, 0) is the u/d quark propagator and it is produced with a Z 3 wall source. We have 
suppressed the associated 7 matrices in this expression. At large t, 

C N (t,p = 0)^V 3 e- m » t . (40) 

The V 3 factor comes from the sum of the noises at the source end, 

Y^(v{Z)v{y)vW) = Vs. (41) 

x,y,z 

As for the variance 

No%(t) « (&{t,0)&*{t,0)) -C 2 N (t,0), (42) 

the lowest mass state in the first term at large t is 3 pions which is lower than that of the 
second term which falls off like e _2mjvi . Besides the large time behavior, the variance has a 
pre-factor from the noise 

^2 (vKxW(yW(z)v(x')v(y')v( z ')) « v 3 3 , (43) 

x,y,z,x',y',z' 

so that the signal-to-noise ratio is 

C N {t,p = 0) _ IW (mjv _ 3 /2m w )t (M] 

— 1/2 

It shows that, besides the familiar large time fall off, there is an additional factor of V 3 
due to the noise. This makes the Z 3 wall source worse than the point source statistically. 

To illustrate the above analysis numerically, we show the relative errors of the pseu- 
doscalar, vector, and nucleon correlators from the Wilson fermion (k = 0.154 which corre- 
sponds to the strange quark mass) on 100 quenched Wilson gauge configurations (/3 = 6.0) 



with one Z 3 wall source. As shown in Fig. [TDJ the relative errors for the pion (Fig. 10(a)) 



is about the same as that for a point source which tends to level off in the range t = [7, 11] 



as expected. For the p (Fig. 10(b)) and nucleon (Fig. 10(c)), we observe that, in addition 
to the expected rise of noise-to-signal ratio at large t, the Z 3 wall source result is worse for 
the p and much worse for the nucleon than those of the point source (N.B. The scales for 
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FIG. 10. Relative errors for (a) pion, (b) rho, and (c) nucleon correlators for different Zj, sources. 
These are calculated with Wilson fermion with n = 0.154 and /3 = 6. 



the ordinates are different in the three sub-graphs.) This is consistent with the extra 1/y/Vs 
factor in Eq. ( )44l) for the nucleon. 

The undesirable large variance of the noise estimate is rooted in the noise contamination 
from the neighboring sites which goes down slowly with N n N g in Eq. ( TPTj) because a n is 
more than an order of magnitude larger than a g [28]. To alleviate this difficulty, unbiased 
subtraction of contamination from the neighboring sites has been employed to reduce the 
variance in the calculation of quark loops 57H6l|. In the case of connected insertions, it 



is found that dilution in time slices 23|, |24|] is effective in reducing contamination from the 
nearby time slices. To carry the suggestion further, dilution of space points within a time 
slice should further reduce the variance [63(. To check this idea, we use a Z 3 grid source 
with support on certain spatial grid points in a time slice to calculate the quark propagator. 
This we refer to as the Z 3 grid source. The results are also included in Fig. [101 The 64/8 
points refers to the points of the grid which are separated by 4/8 lattice spacings in each 
spatial direction on a time slice of the 16 3 x 24 lattice. We first observe that they are all 
better than the Z% wall source. In the case of pion, they are even better than the point 



source at large t. For p and nucleon, it is interesting to note from the inserts in Fig. 10(b) 



and Fig. 10(c) that the relative errors of Z 3 grids are smaller than those of the point source 
roughly in the time ranges which are smaller than the corresponding spatial separations of 



2(3 



the grid points. We will return to this point later when we discuss heavy quarks. 

2. Low-mode substitution (LMS) 

From the above study, we conclude that the noise grid source itself does not improve the 
statistics of the hadron correlators over the point source except for the pseudoscalar meson. 
Next, we shall consider low-mode substitution. Since the meson correlators at large time 
separation are dominated by the low-energy modes, substituting the low-frequency part of 
the noise estimated correlator Cll with the exact one Ctj from the eigenmodes, as is outlined 



in Eq. ffT8|) . has been shown to reduce the variance 22|-|24l] . However, the contributions from 
hadrons on different sites of the source time slice are correlated, particularly among the 
nearby neighbors. To see how correlated they are, we plot the relative errors of Cll at large 
time separation for the wall source as well as the grid sources with 1, 8, 64, 128, 256, and 
512 grid points. 

These grid points are spaced uniformly in each spatial direction on the source time slice. 
Plotted in Fig. [11] are relative errors for the pseudoscalar, vector, and axial-vector mesons 
at a quark mass which corresponds to a pion mass ~ 200 MeV. These are calculated from 
the 400 pairs of eigenmodes from 50 32 3 x 64 DWF lattice configurations with mi = 0.004. 
We see that the relative errors are practically the same from the whole wall down to 64 grid 
points. This shows that there is no practical advantage to use the noise wall source, since 
the low-mode contributions from mesons emerging from different sites are highly correlated. 
Cll would be the same with as little as 64 grid points. 

We have learned that if the grid points are too dense, such as close to that of the wall, 
there is large noise contamination from the neighboring sites. On the other hand, the low- 
mode substituted part of the correlator Cll is highly correlated among neighboring points. 
Thus, the clear choice is to reduce the wall source to a grid source with an optimal separation 
between the grid points to reduce noise contamination and, at the same time, not to sacrifice 
the variance reduction from low-mode substitution and the gain in statistics with multiple 
grid points. This would be a many-to-all approach as opposed to the all-to-all approach. 
It may not make much of difference for the pion, but is expected to work better for other 
mesons and the baryons. This optimal choice of grid points on a lattice could depend on 
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(a)Pseudoscalar (b)Vector (c) Axial- vector 

FIG. 11. Relative errors from the low-frequency modes at large t separation (t = 25 for the 
pseudoscalar meson and t = 20 for the vector and axial-vector mesons) with different number of 
grid points. 

the number of eigenmodes and perhaps the hadrons, such as mesons vs baryons, in addition 
to the balance between high- and low-modes. We have not done a detailed analysis in this 
regard. We shall, nevertheless, present results based on 50 DWF configurations on the 
32 3 x 64 lattice (mi = 0.004) with a Z 3 grid source, which has support on 64 points (4 points 
in each spatial direction with 8 lattice spacings apart), and low- mode substitution with 400 
pairs of low-frequency modes plus the zero modes. 



We first plot in Fig. 12(a) the pseudoscalar correlators from the point source, the Z 3 grid 



source with 64 grid points, and the Z 3 grid source with low-mode substitution for the case 



with pion mass at ~ 200 MeV. Also plotted in Fig. 12(b) are their respective relative errors 



as a function of t. We see that the relative errors of the Z 3 grid source with or without low- 
mode substitution is about a factor of 3 smaller than that of the point source in practically 
all the time range. 

A similar situation exists for the strange quark. The results with pseudoscalar mass at 
~ 670 MeV are plotted in Fig. [T3J 



The case for the charm quark is different. We see in Fig. 14(a) for the quark mass around 



the charm, that the correlator from the Z 3 grid source with low-mode substitution levels 
off for t > 12 and its relative error becomes larger those that of the point and the Z 3 grid 
sources for t > 7. This is the classic example where the signal falls off exponentially and the 
noise estimate levels off at some stage due to a constant variance. In this case, the results 
from the low-mode substitution will not be useful. On the other hand, we notice that the 
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FIG. 12. (color online) (a) The pseudoscalar meson correlators from the point (circle), the Z3 grid 
source with 64 grid points (square) and the Z3 grid source with low-mode substitution (diamond) 
are plotted as a function of t. (b) The respective relative errors are plotted as a function of t. 
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FIG. 13. (color online) The same as Fig. [12] for the strange quark mass corresponding to pseu- 
doscalar mass at ~ 670 MeV. 

error from the Z3 grid is about a factor of 3 smaller than that of the point source at large t 
which resembles the situation with the low-mode substitution for the light quarks when the 
time separation is less than that of the spatial separation of the grid points. It is interesting 
to ponder why this is so. Although without a proof, we venture to speculate that since the 
quarks are confined and the charm quark velocity v is about 30% of the speed of light, spatial 
separation Ax of two grid points greater than v times the Euclidean time separation At, i.e. 
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Ax > vAt, is 'space like' in the Minkowski space sense and that would limit the interference 
of the two sources. For the minimal spatial separation of 8 lattice spacings, this limiting 
At is 27 which is close to the mid point of the time extent of 64. This is consistent with 
what we observed earlier for the light quarks that the relative errors of Z 3 grids are smaller 
than those of the point source roughly in the time range shorter than the spatial minimal 
separation of the grid points. In that case, the light quarks are expected to propagate close 
to the speed of light. 
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FIG. 14. (color online) The same as Fig.[12]for the charm quark mass corresponding to pseudoscalar 
mass at ~ 2979 MeV. 

In the case of the vector meson, we see in Fig. [15] that for light quark mass (m^ ~ 200 
MeV), the relative error due to the Z3 grid source with LMS is a factor of 4 to 5 times 
smaller than those of the point and the grid sources. For the strange quark mass region 
(Fig. [TBI . the LMS has smaller relative error in the range t < 22 and then the error becomes 
larger than those of the point and the grid sources beyond this range. The charm quark case 
in Fig. [T7] is similar to that of the pseudoscalar meson in Fig. [TH Although we do not show 
them here, the axial and scalar meson correlators are similar to the vector meson case. 

We plot the results of the nucleon in Fig. [TS] for the quark mass which has a pion mass 
at ~ 300 MeV. The left half of the time range is the nucleon channel and the right half is 
the 5*ii channel. For the nucleon, we see that the relative error of the grid source becomes 
larger than that of the point source at t ~ 7 which is again close to the 8 lattice spacing 
separation of the grids. The points labeled by Z 3 LLL are those with LMS for all three 
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FIG. 15. (color online) The same as Fig. [12] for the vector meson correlator with light quark 

(m-x ~ 200 MeV.) 
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FIG. 16. (color online) The same as Fig. [TB] for the vector meson correlator with strange quark 
mass corresponding to pseudoscalar mass at ~ 670 MeV. 

quarks. Z% LLL+HLL represents those with LMS for two quarks in addition to Z3 LLL. 
They are denned in Eq. (1211) . We observe that the error from Z 3 LLL is smaller than those 
of the point and grid sources. This reverses the situation where the Z 3 grid source itself is 
worse than the point source as we remarked before. With more LMS from Z3 LLL+HLL, 
the relative error is further reduced and is brought down below that of the point source by 
more than a factor of 4 at large time separation. 
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FIG. 17. (color online) The same as Fig. [14] for the vector meson correlator with the charm quark 
mass corresponding to pseudoscalar mass at ~ 2979 MeV. 
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FIG. 18. (color online) The same as Fig. [T2]for the nucleon correlator with quark mass correspond- 
ing to pseudoscalar mass at ~ 300 MeV. 

V. SUMMARY 

To summarize, we have carried out a study of calculating overlap fermion propagators and 
hadron correlators on the 2+1 flavor domain wall fermion configurations on 16 3 x 32, 24 3 x 64, 
and 32 3 x 64 lattices with both deflation in the inversion and low-mode substitution in 
constructing the correlators. 

With HYP smearing and low-mode deflation, we find a speed up from ~ 23 for the 
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16 3 x 32 lattice with 200 pairs of eigenmodes and sim 51 for the 24 3 x 64 lattice with 200 
pairs of eigenmodes to ~ 79 for the larger 32 3 x 64 lattice with 400 pairs of eigenmodes. 
The cost of the overhead for calculating eigenmodes is 4.5, 4.9 and 7.9 propagators for the 
above lattices, respectively, which will be amortized with calculation of propagators for more 
sources. We have calculated the quark mass dependence of the hyperfine splitting and find 
that one can accommodate charm quarks with small 0(m 2 a 2 ) error. Since this is a mixed 
action approach with overlap on DWF sea, we use the finite volume boundary condition 
property of the scalar correlator to estimate the low-energy constant A m j S for finite lattice 
spacing which is needed for the mixed action partially quenched chiral perturbation theory 
extrapolation to the physical point and the continuum limit. The preliminary result of 
^■mix ~ (427MeV) 4 turns out to be small. It only shifts the 300 MeV mixed valence-sea 
pion mass by ~ 10 MeV at a" 1 = 2.32 GeV for the 32 3 x 64 lattice and ~ 19 MeV at 
a' 1 = 1.73 GeV for the 24 3 x 64 lattice. 

We have examined the signal-to-noise issue for the connected hadron correlators from the 
noise source on a time slice and found that the noise wall source is worse than the point 
source for all mesons except the pion. It is worse still for the baryon (and multi-quark 
systems) by a y^4 factor where V3 is the 3-volume of the time slice. The situation can be 
ameliorated by reducing the contamination from neighboring sites with less source points. 
This introduces the idea of a noise grid source with support on some uniformly spaced grid 
points on a time slice. On the other hand, we find that the low-frequency part of the multiple 
hadron source with exact eigenmodes is highly correlated so that, beyond 64 grid points on a 
time slice of the 32 3 x 64 lattice, the relative errors of the meson correlators do not decrease. 
These observations led to a suggestion of a new algorithm for the grid noise with low-mode 
substitution to reduce the variance from noise contamination while addressing the low-mode 
correlation at the same time. 

We decide to use 64 Z 3 grid noise and low-mode substitution with 400 pairs of eigenmodes 
on the 32 3 x 64 lattice with the light sea mass mi = 0.004 to calculate both the meson and 
baryon correlators. We find that for light quarks (pion masses at 200 - 300 MeV), the errors 
of the mesons and nucleon masses can be reduced by a factor of ~ 3 to 4 as compared to 
the point source. In the strange quark region, the statistical errors of the pion and nucleon 
masses can be improved by a factor ~ 3, but it is not much improved for the vector meson. 
We find that the results from low-mode substitution start to degrade beyond the strange 
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quark region. This is due to the fact that the signal falls off quickly at large t and yet the 
variance of the noise estimation of the high-frequency and the mixed high- and low-frequency 
parts of the correlator does not fall. Luckily, the Z 3 grid results are still better than the the 
point source and can reduce the errors of the charmonium masses by a factor of ~ 3. One 
can use it to address the hadrons involving the charm quark. We should point out that the 
interplay between the noise grid source and low-mode substitution is quite general and is 
not restricted to a particular fermion action. 

So far the study of two-point functions has been favorable. The three-point function with 
Z 3 grid source and low-mode substitution will undoubtedly pose a different set of challenges. 
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